# Panoptic Segmentation

Mask2former Swin Base IN21k Cityscapes Semantic
Other
A general-purpose image segmentation model based on Swin Transformer, unifying instance/semantic/panoptic segmentation tasks
Image Segmentation Transformers
M
facebook
329
0
Mask2former Swin Tiny Cityscapes Semantic
Other
Mask2Former is a unified image segmentation framework capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks. This model is based on the Swin-Tiny backbone network and has been fine-tuned for semantic segmentation on the Cityscapes dataset.
Image Segmentation Transformers
M
facebook
55.98k
3
Mask2former Swin Small Cityscapes Semantic
Other
Small version of Mask2Former based on Swin backbone network, specifically trained for Cityscapes semantic segmentation tasks
Image Segmentation Transformers
M
facebook
952
2
Mask2former Swin Base IN21k Cityscapes Panoptic
Other
Mask2Former is a general-purpose image segmentation model based on Transformer architecture, capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks.
Image Segmentation Transformers
M
facebook
140
0
Mask2former Swin Tiny Ade Semantic
Other
Mask2Former is a unified image segmentation model based on Transformer, capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks.
Image Segmentation Transformers
M
facebook
7,834
1
Mask2former Swin Large Ade Semantic
Other
A large-scale version based on the Swin backbone network, trained on the ADE20k semantic segmentation dataset, employing a unified paradigm for image segmentation tasks.
Image Segmentation Transformers
M
facebook
238.92k
15
Mask2former Swin Base IN21k Ade Semantic
Other
Mask2Former is a universal image segmentation model capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks by predicting a set of masks and their corresponding labels.
Image Segmentation Transformers
M
facebook
879
2
Mask2former Swin Base Ade Semantic
Other
A general-purpose image segmentation model trained on the ADE20k dataset, using a unified framework to handle instance/semantic/panoptic segmentation tasks
Image Segmentation Transformers
M
facebook
2,811
0
Mask2former Swin Large Ade Panoptic
Other
Mask2Former model trained on the ADE20k panoptic segmentation dataset using a Swin large backbone network, employing a unified paradigm to handle instance segmentation, semantic segmentation, and panoptic segmentation tasks.
Image Segmentation Transformers
M
facebook
2,625
4
Mask2former Swin Large Mapillary Vistas Semantic
Other
A large-scale Mask2Former model based on the Swin backbone network, designed for general image segmentation tasks, unifying instance segmentation, semantic segmentation, and panoptic segmentation.
Image Segmentation Transformers
M
facebook
5,539
3
Mask2former Swin Large Cityscapes Semantic
Other
A large-scale Mask2Former model based on the Swin backbone network, specifically trained for Cityscapes semantic segmentation tasks, adopting a unified architecture for various image segmentation tasks.
Image Segmentation Transformers
M
facebook
296.33k
24
Mask2former Swin Small Cityscapes Panoptic
Other
A compact Mask2Former model based on Swin backbone network, optimized for panoptic segmentation tasks on the Cityscapes dataset
Image Segmentation Transformers
M
facebook
568
0
Mask2former Swin Large Cityscapes Panoptic
Other
Mask2Former model based on Swin backbone network, specifically optimized and trained for panoptic segmentation tasks on the Cityscapes dataset
Image Segmentation Transformers
M
facebook
772
2
Mask2former Swin Tiny Cityscapes Panoptic
Other
Mask2Former model based on Swin-Tiny backbone, optimized for Cityscapes panoptic segmentation tasks
Image Segmentation Transformers
M
facebook
2,126
0
Mask2former Swin Tiny Coco Panoptic
Other
Mask2Former is a Transformer-based unified image segmentation model supporting instance segmentation, semantic segmentation, and panoptic segmentation tasks, utilizing masked attention mechanism to enhance performance
Image Segmentation Transformers
M
facebook
4,538
8
Mask2former Swin Small Coco Panoptic
Other
A small-scale version of Mask2Former based on Swin backbone network, optimized for panoptic segmentation tasks on the COCO dataset
Image Segmentation Transformers
M
facebook
240
1
Mask2former Swin Large Coco Panoptic
Other
A large-scale version of Mask2Former based on the Swin backbone network, specifically trained for panoptic segmentation tasks on the COCO dataset
Image Segmentation Transformers
M
facebook
37.67k
30
Mask2former Swin Base Coco Panoptic
Other
The Mask2Former model based on the Swin backbone network, trained on the COCO panoptic segmentation dataset, adopts a unified paradigm to handle instance segmentation, semantic segmentation, and panoptic segmentation tasks.
Image Segmentation Transformers
M
facebook
45.01k
14
Oneformer Coco Dinat Large
MIT
A unified single Transformer architecture for image segmentation, supporting three major tasks: semantic segmentation, instance segmentation, and panoptic segmentation
Image Segmentation Transformers
O
shi-labs
38
7
Oneformer Ade20k Dinat Large
MIT
The first multi-task universal image segmentation framework supporting semantic/instance/panoptic segmentation with a single model
Image Segmentation Transformers
O
shi-labs
2,275
12
Detr Resnet 50 Panoptic
Apache-2.0
DETR is an end-to-end object detection model based on Transformer architecture, using ResNet-50 as the backbone network, trained on the COCO dataset, and supports object detection and panoptic segmentation tasks.
Image Segmentation Transformers
D
facebook
9,586
137
Maskformer Swin Large Coco
Other
Large-scale MaskFormer model based on Swin backbone network, unifying instance/semantic/panoptic segmentation tasks
Image Segmentation Transformers
M
facebook
849
24
Maskformer Swin Small Coco
Other
A small MaskFormer model based on the Swin backbone network, trained on the COCO dataset for panoptic segmentation tasks.
Image Segmentation Transformers
M
facebook
2,293
3
Maskformer Swin Base Coco
Other
A panoptic segmentation model based on the Swin backbone network, trained on the COCO dataset, unifying instance/semantic/panoptic segmentation tasks
Image Segmentation Transformers
M
facebook
3,855
24
Maskformer Swin Tiny Coco
Other
A panoptic segmentation model trained on the COCO dataset, using a unified paradigm to handle instance/semantic/panoptic segmentation tasks
Image Segmentation Transformers
M
facebook
301
6
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase